Natural Language News Generation from Big Data

نویسندگان

  • Bastian Haarmann
  • Lukas Sikorski
چکیده

In this paper, we introduce an NLG application for the automatic creation of ready-to-publish texts from big data. The resulting fully automatic generated news stories have a high resemblance to the style in which the human writer would draw up such a story. Topics include soccer games, stock exchange market reports, and weather forecasts. Each generated text is unique. Readyto-publish stories written by a computer application can help humans to quickly grasp the outcomes of big data analyses, save timeconsuming pre-formulations for journalists and cater to rather small audiences by offering stories that would otherwise not exist. Keywords—Big data, natural language generation, publishing, robotic journalism.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Opinion Mining to Analyze News for Stock Market Prediction

This is a known fact that news and stock prices are closely related and news usually has a great influence on stock market investment. There have been many researches aimed at identifying that relationship or predicting stock market movements using news analysis. Recently, massive news tests, called unstructured big-data, have been used to predict stock price. In this paper, we introduce a meth...

متن کامل

Using Big Data Opinion Mining to Predict Rises and Falls in the Stock Price Index

In light of recent research that has begun to examine the link between textual “big data” and social phenomena such as stock price increases, this chapter takes a novel approach to treating news as big data by proposing the intelligent investment decision-making support model based on opinion mining. In an initial prototype experiment, the researchers first built a stock domain-specific sentime...

متن کامل

Using Very Large Scale Ontologies for Natural Language Generation

Ontology-based natural language generation can be a determinative factor for the digitalization in the publishing, media and content production industry. Based on the technology presented here, in the foreseeable future the amount of generated news will exceed that of news written by human authors. In future, publicly available data in the domains of weather, sports, finance, traffic, events or...

متن کامل

Natural Language Processing Methods Used for Automatic Prediction Mechanism of Related Phenomenon

The paper presents an idea to combine variety of Natural Language Processing techniques with different classification methods as a tool for automatic prediction mechanism of related phenomenon. Different types of preprocessing techniques are used and verified, in order to find the best set of them. It is assumed that such approach allows to recognize the phenomenon which is related to the text....

متن کامل

Data-Driven News Generation for Automated Journalism

Despite increasing amounts of data and ever improving natural language generation techniques, work on automated journalism is still relatively scarce. In this paper, we explore the field and challenges associated with building a journalistic natural language generation system. We present a set of requirements that should guide system design, including transparency, accuracy, modifiability and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015